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Introduction 

The accountability movement has focused a great deal of attention to 
improved decision making for teachers and administrators through effective 
use of test results. Commercial test companies provide a number of options 
to test users. Such optional services include: item analysis, items keyed 
to objectives, listing of specific pages in a number of references for 
additional study if individual items or groups of items are answered incor- 
rectly, and a number of other optional reports. 

In addition to commercial test materials, teachers cpnstruct items to 
measure attainment of objectives they have set for their students. Most of 
the development efforts of teachers and commercial publishers have been 
centered on developing paper and pencil objective test items which can be 
readily scored. Primarily from this data, the teacher makes Judgments about 
pupils. Decisions such as grouping, grading, remedifil work, new assignments, 
and promotion and retention are typical examples. Test scores are recorded 
but little other data regarding the pupils' school experience are maintained. 
Such i information as the ability to work with others, the amount of time spent 
on t|ie learning task by the student, the initiative shown In completing a 
task, determination or perseverance to complete a task, identified areas of 
pupil interest, the ability to work independently and to communicate verbally 
may or may not be included in the data used to make judgments regarding 
individual students. 

One problem prevalent with all machine scoreable paper and pencil tests 
is the time lapse between test administration and the reporting of results 
to the students. This practice continues even though research has consistently 
shown that iomediate feedback directly affects the student's future test 
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performance. The student's desire to know answers is at a high peak following 
a test. Responding inmediately to students' questions about test items 
missed may contribute to fui^ther learning and will help the teacher to 
identify questions most troublesome to the students. One of the short- 
comings associated with standardized testing programs is the minimal feedback 
to the student regarding performance except in such general terms as a 
standard score or percentile rank with little or no discussion of the test 
items. This limited type of reporting often results in top performance 
from only highly motivated test takers. 

Specifying the purposes and uses to be made of a test prior to administra- 
tion is of paramount importance. When developing or selecting a test, it should 
be predetermined how the test results will be used. Possible questions needing 
answers may include: Is the purpose of the test to provide Justification 
for a grade? Will the data be useful in diagnosing pu^il strengths and 
weaknesses? How do the results from this test compare with those in other 
areas? Why do some pupils perform less well in one area than in others? 
Are these results acceptable or can something be done to improve future 
performance? Will the results aid in identifying pupils with special 
problems? Answers to the above questions may raise additional concerns for 
the test administrator and he may have to look to other information sources 
or devise other means to obtain relevant data on student performance. 

Thifi paper is like many tests in that it may raise more questions thkn 
it answers. When one considers the »^.me administrators and teachers spend 
on selecting, developing, administering and scoring tests and the amount 
of student time required to complete the tests, then one nust be concerned 
with what happens after testing. Administrators and teachers must carefully 
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review their own use of test results, continually seek better ways to use 
the data and/or discontinue some of the testing if the information is not 
being used. Having a folder full of test scores on each student contributes 
little to student learning unless meaningful decisions can be made to improve 
future educational experiences. 

Before any formal testing gets underway, all available information on 
file should be reviewed to determine if additional data is necessary. To 
justify the test, it must be ascertained that more precise information is 
needed and that the data derived from the test will add something of significance 
to what is already known. 



Why Test? 

Teachers and administrators are constantly confronted with the need for 
accurate information on the performance of a student or a group of students • 
Information may be required for parent conferences, to identify students for 
special help or to inform the student of his progress. Information requirements 
of the teacher may be at one level while those of the administrator may be 
at another level. For example, the teacher may need to check on student 

progress in order to assign the next learning activity, while the administrator 

o 

may require data from a number of classrooms to evaluate the appropriateness 
of curriculum materials. 

Whatever the need for information, it should be clearly specified prior 
to any test administration if it is to be useful for purposes other than 
grading. Data collection should be limited to that which can be analyzed 
and interpreted by the teacher and/or the administrator. If one test 
adminiWafeion can serve the needs of both users, it not only will be more 
cost effective but will allofw additional time for instructional activities. 

Questions for Administrators and Teachers to Consider Before Adopting a 
School Testing Program 

To make effective use of test data, each decision-maker must specify 
the information needed prior to test selection and administration. Decision- 
makers may include parents, students, teachers, administrators, counselors 
and school boards. At least one unique piece of information should be 
singled out for each decision group and it should be specified how the data 
collected for one group relates to the total data collection. 

Some general questions that need answers before any data collection plan 
is implemented include: 
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I8 Information needed on individuals or groups of students? 

Do parents want to know how well their son/daughter is doing 
compared to students in other schools? 

What are the perceptions of parents, teachers and students as to 
what the school is doing? 

What are the expected outcomes by the teacher before the data is 
collected? Individual vs group. 

Norm-Referenced Testing Programs 

Specific questions on what data to collect and how to make effective 
use of the data for evaluation purposes include: 

\) Is there an open discussion with faculty and/or students about 
the needs and aims of a measurement program? 

2) How ready and willing is the staff to administer tests and 
interpret scores? 

3) Are the objectives of the testing program clearly stated? 

4) How can testing most effectively and efficiently contribute 
to the design of Improved educational experiences? 

5) Will testing after an educational experience focus on the 
information needs required for planning the next educational 
experience? 

6) Will test results be studied longitudinally with information 
gathered systematically for teacher use and fitted into a 
well designed program? 

7) Is the teacher familiar with the test instrument to be used? 

8) Is the teacher involved in the selection of the test? 

9) Is the teacher aware of the specific measurement characteristics 
of the test? 

10) What are the limitations of the test? 

11) Has the teacher read the publisher's statement of what the 
test is deaigned to measure? 

12) Has the teacher read a description and evaluation of the test 
in Buros Mental Measurement Yearbook? 

13) Has the teacher examined a specimen set of the items to be 
administered? 
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14) Has the teacher reviewed curricular objectives and subject 
content in the area(s) to be tested? 

15) Has the teacher taken the test to check content validity? 

16) What, is the nature of the pupil population to be tested? 

17) Will the test provide the student with information about 
himself in establishing realistic educational and vocational 
goals? 

18) Will the test data be useful to parents in helping them 
understand that realistic goals must be set in the education 
of their children? 

19) Will test data be used as ^ ba^is for discussion in inservice 
meetings? 

20) What is the purpose of the test? 

Plac^ent - Selective grouping according to knowledge 
or skill in some subject area. 

Diagnosis - To identify causes rather than outcomes. 

Assessment - To collect census-like data on students to 
measure attainment of objectives related 
to knowledge, skills, and understandings. 

Prediction - Predict chances for success in each course 
selected in the following year. 

Evaluation - Study performance trends of pupils in certain 

subject areas. ( 

21) What item analysis is planned following the testing administra- 
tion to possibly reorganize the course of study? 

22) What additional information will be needed to substantiate the 
test data? 

23) What type of presentation is planned so that test results will 
be reported in language clearly understandable by students 
and parents? 

24) What steps have been taken to insure test data will not be 
misused? 

25) Have any plans been developed to identify teaching strategies 
for individuals, small groups, or the entire class for those 
objectives where students do not perform up to expectations? 

1 26) Are students encouraged to question teachers and counselors 
regarding their performance on a particular test? 

27) Are students encouraged to express their feelings in writing 
about test scores and the implications for future actions on 
their behalf? 

8 
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Norm-referenced test data may be useful for administrators in making 
decisions regarding the following questions: 

1) Does the data indicate any trends in student achievement? 

2) How does local student performance compare with statewide 
performance? 

3) Do pupils denonstrate greater competency at one grade level 
than at another? 

4) Have competencies increased or decreased over the past three 
years? 

5) Have previously identified weaknesses been resolved? 

6) Do the competency levels vary from building, to building? 

7) Who and where are the pupils with special problems? 

8) What skill areas need additional human and financial resources? 

9) Should additional time be allocated to the basic skills in an 
effort to improve performance? 

\ 10) Should the expectation level for student achievement by the 

\ teaching staff and the community be revised? 

I 

Assessing a Local Testing Program 

t 

1) Who in the school system will use data from the standardized testing 

' program? Teachers^ students, counselors, administrators, parents and 

school board. 

2) What is the primary purpose of each test administration? 

3) What follow-up action was taken after the test administration last year? 
Item analysis, curriculum modifications, placement of students or similar 
type actions. 

4) Is the testing program evaluated each year? 

5) What processes are used to prepare students for a test? 

6) What provisions are made to coordinate testing efforts across grade 
levels in the school? 

7) How much teacher time (estimated) is spent in administering standardized 
tests? Include time spent on scoring, recording, and interpreting results 
to students. 

8) What other information will be put together with the test scores prior to 
O , making a judgment? 
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9) Are all students tested or is a sample population of students tested to 
"SaE^inf erences ? 

10) List all standardized tests (all types) administered last year). 

Classroom Tests 

The objectives of the test and those of the instructional program should 
be closely related. The teacher or administrator should identify crucial 
decisions where additional data is required and then select or develop a 
test to collect data to improve 4)ractices in the classroom or school. There 
are few available guidelines to direct the decision maker in this process. 

Careful thought should be given when the major purpose of testing serves 
only to justify a grade. If this is ^he primary purpose, then standards of 
performance should be expressed for a particular grade and tHese should be 
consistent from teacher to teacher and from school to school. This would 
enable students interested in getting a good gi|ade to concentrate on mastery 
of the skill or subject. Students'^no longer would ^ave to *'psych out" the 
teacher to determine what is essential to obtain a good grade. 

As test results are seldom a true indicator of what a student knows 
or doesn't know, testing for grading purposes should be done frequently 
rather than at the end of a unit. Even then the results may not provide an 
accurate picture of a student's achievement, as some students may not be 
able to demonstrate mastery by means of a paper and pencil exam. 

When the primary purpose of testing is to improve instruction, tests 
may provide information for a number of instructional and management decisions. 
Results from classroom tests may be used to place students in classes, identify 
students for additional testing and diagnosis, predicting chances for success 
in future courses or to assess student achievement before and. af ^er instruction. 
Classroom testing should contribute to student learning through better 
O planned learning activities designed to meet individual and group needs and 

10 



-6- 



\ 



ERIC 



not be administered merely to assign a grade. 

The following standards sj^ould be considered when developing, adminis- 
tering, and reporting test results: 

1) Ease of test scoring should be only one of several items to 
be considere4 whgtir^s electing or developing a test. It is 
equally of more important to select or develop a test that 
can provide diagnostic data enabling the teacher to better 
assist the learner in. mastery of the learning task. 

2) The testing program should provide a number of ways for 
pupils to demonstrate mastery such as oral reports, 
individual tests, demonstratioi^s, open book, etc. 

3) The test should contain a balance of recall and application 
items as w^ll as those requiring higher levels of thinking 
whdre the student must make ^judgment. 

4) Each test administration shouid Include one open-ended response 
item, preferably a thought prdvoking question which will require 
a student to interpret, to apply and evaluate. The item should 
encourage the student to express some idea of his own. 

5) The test should include items requiring transfer of skills 
from the classroom to real life experiences whenever possible. 

6) Test items should be based upon the material covered by . 
instruction ard students should be prepafed by being informed 
of the general purpose of the test. Purposes are to: iiiform 
parents of student progress, to diagnose strengths and weak- 
nesses of the class, or to create student awareness of needs ^ 

. and progress. * \ i 

7) Unanticipated outcomes from testing should be carefully ob- 
served. Are students frustrated, copying from other students, 
or making random responses? 

8) An adequate reward system should be provided for all students. 
Constructive comments by the teacher on all tests may be adeqt^ate. 

9) Iinnediate reporting of test results to students should be the 
rule of the day. The student is most interested in items 
answered correctly as well as items missed immediately fol- 
lowing the test. 

10) The possible benefits resulting from the test administration 
should always be weighed against the time required to carry, 
out the testing. 

11) The test data must be manageable for decision making. For 
example, an item analysis conducted following each test may 
yield more data than one teacher can work with effectively. 
The teacher may prefer to review data on a selected portion 

11 
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of the class, i.e., those students who did not score as 
well as expected or the lowefet 20 percent rather than 
attempt to analyze data from all ptudents. 

12) Observ^ation of student behavior during test administration 
is essential to gain additional insight into students. This 
information may be helpful when one iis analjfiing.a particular 
student's performance. 

Possible instructional and management decisions that may result from an 
analysis of test data: 

1) Assignment of additional drill and practice. 

2) Review! previous materials with individual or in small "groups. 

3) Review test items to reinforce concepts covered previously, 

, 4) Assignment of library resources — magazines, papers and other 
books. \ 

5) Instructor confers with individual(s) or small )groups with 
similar problems. 

6) Pa rent- teacher- student conference. 

7) Use of programmed instruction for skill development. 

8) Further diagnostic testing to identify causes for poor 
performance. 

9) Recommend student (s) for remedial help. 

10) Use other students to provide individual help or assign students 
to small group for additional help. 

11) Refer stu^nt(s) to counselor, psychologist. 

12) Retest — by using a sample of items from the original test,\ 
, ^ oral test or demonstration of proficiency in aniother way. 

Open-book test with another student. 

13) Teacher may elect to spend additional time with student(8) 
experiencing problems. 

14) Individual and/or small groups provided filmstrips, cassette 
tapes, games covering material {Resented. 

15) Offer different reward system for part or all of the students. 

16) Advance students to the next unit. 

Er|c 12 



Preparing Test Items 

The teacher should prepare test itemQ prior to introduction of a 
new unit or develop them as the unit progresses to insure that the items 
are valid for t?ib^roup taking the test. This process will Insure that 
one is testing what has been taught rather than relying on mtimory of 

what should hpve been taught. Test items should be formulated from con-- 

I 

cepts which were thoroughly covered in class. J: '?icra::y number of 

items must be answered correctly before a student may progress to the next 
instructional unit, this reqxiiremenc should be clearly specified prior to 
the test administration* 

Preparing Students for Testing 

In preparing students for testing, the teacher should explain the 
general purpose to be served by the test. For example, if the test is to 
be administered to provide information for parent conferences, students 
should be so informed. If the purpose is to assist the student in becoming 
aware of his progress, then the teacher must follow-up the test administra- 
tion through written comments or interviews with students not making the 
desired progress. 

Test Administration 

Feedback concerning successes and failures is an essential element 
in motivation* If maintaining a high degree of motivation for test 
taking is a goal of the school, then answer keys should be made avail- 
able to students immediately after the test. It has been' demonstrated 
that personal comments by the teacher regarding what the student did well 
and where improvement is desired tends to improve future performance more 

i ' 

13 



than the mere assignment of a letter grade. Student motivation is enhanced 
when he has immediate feedback^eonceming his progress. 

Purposes of ^Testing 

Tnough the overall p rpose of testing is to improve instruction, five, 
intermediate purposes to achieve this goal can be identified. These pur- 
poses are not always discrete. For example, a decision maker may use 
diagnostic test results to place low scoring individuals for small group 
instruction within a class* Using diagnostic test information may eliminate 
the "need for placement tests for this group of students. 

Each of the five intermediate purposes for testing are described 

/ 

below. 

Placement Tests 

Placement tests provide information for assigning students to groups, 
classes, or courses. The test provides a gross level achievement score 
which is used to help identify the appropriate instructional level for the 
student. Other factors which are considered in placement include: student 
age, past performance and teacher judgment. Individual pupils are classi- 
fied in relation to each other and in relation to the subject matter content. 

Assessment 

The main purpose of assessment is to compare pupil performance before 

/ 

and after instruction to determine the amount of growth or change taking 
place. Assessment tests may also be used to assess the effectiveness of 
teaching methods or instructional materials. It is concerned primarily 
with what has been learned. 
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Prediction 



In prediction, the teacher estimates the capacity of students to 
achieve success in a course elected for the following year. The teacher 
makes-fche Judgment based on the student's current performance. For example, 
it might be predicted that a good achievement test Score in general science 
would lead to later success liuphysics. This use of test results should 
be carefully checked against empirical evidence to verify the correctness 
of the assumption. 

Evaliiation 

Test results may be used to study trends or to report to the community 
on how well the students are performing. This Judgment usually includes 
comparing a school's achievement test scores with achievement scores of a 
comparable group of schools. 

Diagnosis 

Classroom tests should be administered on a regular basis to locate 
' weak spots in the new work that has been presented so corrective action can 
be taken at once. Day-by-day observations of the teacher may also provide 
insights regarding the students' problem so that action can be taken before 
it becomes serious. One objective of testing should be to help motivate 
students causing them to exert the necessary effort to master the task in 
the alloted time. 

• Specific diagnostic tests may be administered to determine where a 
weakness or deficiency exists. Diagnostic tests can be useful. in pinpoint- 
ing problems as they include a number of items for measuring each specific 
objective. A diagnostic test is designed to probe into the causes for poor 
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performance. The question needing answered must be specific, such as: "Does 
Johnny have the necessary vocabulary to read?", not "Why can't Johnny read?" 

When a teacher analyzes diagnostic test results, he must consider the 
student's rate of work, his mDtivation, the accuracy of his responses and 
the difficulty level where most of the problems occur. Other evidence which 
may be contributing to the poor performance should be reexamined. For ex- 
ample, students who work rapidly and m&ke numerous mistakes may do so due 
to carelessness, indifference or because of a lack of basic knowledge or skills, 
Further diagnosis may be in order, either through additional testing or via 
a ceacher-pupil conference. Students who work slowly but are quite accurate 
may improve their performance through well-directed practice. The teacher 
may assign two or three students to a group to meet regularly to review the 
content covered in the test. Tutorial help either from the teacher, other 
teachers, aides, or student assistants should be provided whenever possible. 

A number of approaches may be required to help students overcome learn- 
ing difficulties. Improving work-study skills may be all that is required 
in some instances, while in others emotional and motivational problems may 
have to be dealt "'ith in addition to work-study skills. Hopefully, diagnostic 
testing will enable the teacher to focus his teaching where it will produce 
the best results. 

Diagnostic tests should be carefully chosen according to the specific 
information desired. Tests will identify the typical errors a student makes 
but the teacher must analyze the test results to identify possible causes 
for the errors. The tesf. serves only as a starting point and additional 
supplementary information must be collected before an effective program can 
be designed for most students with learning difficulties. / 

To determine causes for persistent learning difficulties, the teacher 
may have to look not only at scholastic aptitude, mastery of the basic 
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skills and work-study skills, but at the home environment and the physixal 
and emotional condition of the student. 

In addition to analyzing diagnostic test results, diagnosis of learning 
difficulties should include information on a number of the following questions: 

1) Approximately how much student time was spent on the learning 
task? 

2) What types of assistance were provided to help the student 
with his problem? 

3) Who provided the assistance? 

4) What skill deficiencies must be overcome? 

Reading — vocabulary 

comp r ehens io n 
word attack skills 

Listening 
Writing 

5) Were there any unusual problems observed during the test? 

6) Which students asked an unusually large number of questions? 

7) Which students had difficulty concentrating on the test? 

8) What probe questions were used to identify the source of the 
error? 

a) Did you understand the question? 

b) Why did you give this answer? 

c) Why did you leave the question unanswered? 

d) What gave you trouble on the question? 

e) How did you study for the test? 

9) Was there any pattern evident in the questions missed? 
10) Does the student have any physical or emotional problem? 
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Summary Comments 

Whether testing requites five minutes or an hour, the cost of the test, 
the analyses of results, and the teacher's time required to administer the 
test must be considered relative to the use that can be made of the Information. 

If one of the major purposes of testing is to improve Instruction, then 
some changes in test administration are long overdue. Administering a group 
test (norm-referenced or teacher made) to a room full of students is a very 
impersonal experience as all participants are treated alike. Many schools 
profess to individualize Inotruction but the test administrations completely 
ignore individual differencea. The fact that some students work taore slowly 
than others or tl^at some are unable to work at a task for more than 15 minutes 
without some kind o^ break are not taken Into account. Oftentimes the only 
feedback from this inpersonal experience Is a number (a raw score or a per- 
centile score) and this may not come through for several days or weeks. 
Little or no personal Interaction takes place between student and teacher 
with regard to the contents of the test. Frequently much of the information 
collected in the test is already available. Research has indicated that 
formal teacher judgment usually is in close agreement with the data gathered 
from a formal test. The value of colliecting additional test Information 
should be judged by how much it improves decisions over the best possible 
decisions made without It. In terms of cost effectiveness, it might be more 
profitable for the teacher to gather extensive data on those students need- 
ing special help rather than collecting data nn all students. One must always 
keep in mind the possible unanticipated outcomes that may result from adminis- 
\ taring the test. (Does the student become less and less motivated to take 
tests? Does the student experience one failure after another? Does the 
stu^nt have Inadequate time to complete the test?) These are but a few of 

\ 18 
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questlons that should be asked by the test administrator, and answers can 

be obtained only through careful observation during the test and in analyzing 

and reporting of the results. 

Prior to any test, the possible decisions that will be made from the 
data must be identified. In addition, the teachers and administrators must 
commit the necessary time and resources to do something with the results. 

Those responsible for making decisions regarding students most take care 
to avoid becoming overly dependent upon paper and pencil testing. Other 
important learning outcomes (speaking and writing skills and work habits) 
should be observed and the information incorporated with test data before 
any Judgments are made about an individual. 



\ 
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